Sparse Cut Projections in Graph Streams

نویسندگان

  • Atish Das Sarma
  • Sreenivas Gollapudi
  • Rina Panigrahy
چکیده

Finding sparse cuts is an important tool for analyzing large graphs that arise in practice, such as the web graph, online social communities, and VLSI circuits. When dealing with such graphs having billions of nodes, it is often hard to visualize global partitions. While studies on sparse cuts have traditionally looked at cuts with respect to all the nodes in the graph, some recent works analyze graph properties projected onto a small subset of vertices that may be of interest in a given context, e.g., relevant documents to a query in a search engine. In this paper, we study how sparse cuts in a graph partition a certain subset of nodes. We call this partition a cut projection. We study the problem of finding cut projections in the streaming model that is appropriate in this context as the input graph is too large to store in main memory. Specifically, for a d-regular graph G on n nodes with a cut of conductance Φ and constant balance, we show how to partition a randomly chosen set of k nodes in Õ( 1 √ αΦ ) passes over the graph stream and space Õ(nα + n k √ αΦ19/4 ), for any choice of α ≤ 1. The resulting partition is the projection of a cut of conductance of at most Õ( √ Φ). We note that for k < nαΦ, this can be done in Õ(1/ √ αΦ) passes and space Õ(nα) that is sublinear in the number of nodes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Cut Projections on Graph Streams

Finding sparse cuts form an important tool for analyzing/partitioning large graphs that arise in practice, such as the web graph, online social communities, and VLSI circuits. A well developed framework for working with such large graphs is the streaming model wherein the input is assumed to be on disk, and any algorithm is allowed to make few passes over the input that may be too large to stor...

متن کامل

Random Projections, Graph Sparsification, and Differential Privacy

This paper initiates the study of preserving differential privacy (DP) when the data-set is sparse. We study the problem of constructing efficient sanitizer that preserves DP and guarantees high utility for answering cut-queries on graphs. The main motivation for studying sparse graphs arises from the empirical evidences that social networking sites are sparse graphs. We also motivate and advoc...

متن کامل

Tight Bounds for Graph Problems in Insertion Streams

Despite the large amount of work on solving graph problems in the data stream model, there do not exist tight space bounds for almost any of them, even in a stream with only edge insertions. For example, for testing connectivity, the upper bound is O(n logn) bits, while the lower bound is only Ω(n) bits. We remedy this situation by providing the first tight Ω(n logn) space lower bounds for rand...

متن کامل

Face Recognition using an Affine Sparse Coding approach

Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image and video. Sparse coding has increasing attraction for image classification applications in recent years. But in the cases where we have some similar images from different classes, such as face recognition applications, different images may be classified into the same class, and hen...

متن کامل

Analyzing Massive Graphs in the Semi-streaming Model

Massive graphs arise in a many scenarios, for example, traffic data analysis in large networks, large scale scientific experiments, and clustering of large data sets. The semi-streaming model was proposed for processing massive graphs. In the semi-streaming model, we have a random accessible memory which is near-linear in the number of vertices. The input graph (or equivalently, edges in the gr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009